Maximum mutual information based reduction strategies for cross-correlation based joint distributional modeling

نویسنده

  • Jeff A. Bilmes
چکیده

In maximum-likelihood based speech recognition systems, it is important to accurately estimate the joint distribution of feature vectors given a particular acoustic model. In previous work, we showed we can boost accuracy in this task by modeling the joint distribution of time-localized feature vectors along with statistics relating those feature vectors to their surrounding context. In this work, we evaluate information preserving reduction strategies for those statistics. We claim that those statistics corresponding to spectro-temporal loci in speech with relatively large mutual information are most useful in estimating the information contained in the feature-vector joint distribution. Furthermore, we claim that such statistics are most likely to generalize. Using an EM algorithm to compute mutual information between pairs of points in the time-frequency grid, we verify these hypothesesusing both overlap plots and speech recognition word error results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Joint Distributional Modeling with Cross-Correlation Based Features

In maximum-likelihood based speech recognition systems, it is important to accurately estimate the joint distribution of feature vectors given a particular acoustic model. In this work, we propose that by modeling the joint distribution of time-localized feature vectors and statistics relating those time-localized feature vectors to the relevant acoustic context, we can estimate information con...

متن کامل

Script Induction as Language Modeling

The narrative cloze is an evaluation metric commonly used for work on automatic script induction. While prior work in this area has focused on count-based methods from distributional semantics, such as pointwise mutual information, we argue that the narrative cloze can be productively reframed as a language modeling task. By training a discriminative language model for this task, we attain impr...

متن کامل

MIPA: Mutual Information Based Paraphrase Acquisition via Bilingual Pivoting

We present a pointwise mutual information (PMI) based approach for formalizing paraphrasability and propose a variant of PMI, called mutual information based paraphrase acquisition (MIPA), for paraphrase acquisition. Our paraphrase acquisition method first acquires lexical paraphrase pairs by bilingual pivoting and then reranks them by PMI and distributional similarity. The complementary nature...

متن کامل

Assessment of uncertainty for coal quality-tonnage curves through minimum spatial cross-correlation simulation

Coal quality-tonnage curves are helpful tools in optimum mine planning and can be estimated using geostatistical simulation methods. In the presence of spatially cross-correlated variables, traditional co-simulation methods are impractical and time consuming. This paper investigates a factor simulation approach based on minimization of spatial cross-correlations with the objective of modeling s...

متن کامل

Studying the Adjustment Amount of Ranking the Performance of Mutual Funds Based on Omega Ratio and Real Return

One of the main functionalities of capital market is to enhance liquidity in the market. Mutual funds are modern financial institutions which are designed with the aim of absorbing funds from investors and devote them to buy a variety of securities in order to reduce investment risks, exploit the economies of scale and finally make a reasonable return for investors. Regarding effective role of ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998